Skip to content

Conversation

@liamlego
Copy link

@liamlego liamlego commented Oct 7, 2025

Description

The following PR fixes the stac-geoparquet exports of the collections, particularly ones that involve partitioned export by implementing changes made in this PR to optimize the stac-geoparquet package: stac-utils/stac-geoparquet#109

Fixes # (issue)

Type of change

Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • This change requires a documentation update

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Checklist:

Please delete options that are not relevant.

  • I have performed a self-review
  • Changelog has been updated
  • Documentation has been updated
  • Unit tests pass locally (./scripts/test)
  • Code is linted and styled (./scripts/format)

@liamlego liamlego marked this pull request as ready for review October 31, 2025 18:15
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR significantly refactors the STAC-geoparquet workflow to improve performance and functionality. The changes focus on updating dependencies, implementing custom export logic, and adding new collection support.

  • Updates the container image version and workflow configuration for test and production environments
  • Replaces reliance on external stac-geoparquet runner with custom implementation for better control
  • Adds comprehensive support for HLS2 collections with geoparquet asset definitions

Reviewed Changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 4 comments.

Show a summary per file
File Description
datasets/stac-geoparquet/workflow_test.yaml Updated image version and changed test collection from io-lulc-annual-v02 to sentinel-2-l2a
datasets/stac-geoparquet/workflow.yaml Updated image version, renamed job from "stac" to "geoparquet", changed task ID from "create" to "update", and updated configuration table reference
datasets/stac-geoparquet/test.ipynb Added new test notebook with sample code for validating geoparquet exports
datasets/stac-geoparquet/requirements.txt Major dependency updates including custom git branch and potentially invalid future-dated versions
datasets/stac-geoparquet/pc_stac_geoparquet.py Complete rewrite with custom collection export logic, partitioning support, and rendering configuration
datasets/stac-geoparquet/README.md Updated documentation with local development instructions and corrected typo
datasets/stac-geoparquet/Dockerfile Modernized base image to Azure Linux Python 3.12 with UV package manager
datasets/hls2/collection/hls2-s30/template.json Added license information and geoparquet-items asset definition
datasets/hls2/collection/hls2-l30/template.json Added license information and geoparquet-items asset definition

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

committing copilot suggestions

Co-authored-by: Copilot <[email protected]>
@liamlego
Copy link
Author

liamlego commented Oct 31, 2025

This is the pctasks workflow id I confirmed it with: 19229c03-98e0-4a33-ba0d-3970ac612fd7 . It ran just on sentinel-2-l2a

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants